Efficient Snapshot Differential Algorithms for Data Warehousing
نویسندگان
چکیده
Detecting and extracting modifications from information sources is an integral part of data warehousing. For unsophisticated sources, it is often necessary to infer modifications by periodically comparing snap shots of data from the source. Although this snapshot differential problena is closely related to traditional joins, there are significant differences, which lead to simple new algorithms. In particular, we present algorithms that perform compression of records. We also present a window algorithm that works. very well if the snapshots are not “very different.” The algorithms are studied via analysis and an implementation of two of them;‘the results illustrate the potential gains achievable with the new’ algorithms.
منابع مشابه
Eecient Snapshot Diierential Algorithms for Data Warehousing
Detecting and extracting modi cations from information sources is an integral part of data warehousing. For unsophisticated sources, it is often necessary to infer modi cations by periodically comparing snapshots of data from the source. Although this snapshot di erential problem is closely related to traditional joins, there are significant di erences, which lead to simple new algorithms. In p...
متن کاملDi erential Algorithms for Data Warehousing
Detecting and extracting modi cations from information sources is an integral part of data warehousing. For unsophisticated sources, in practice it is often necessary to infer modi cations by periodically comparing snapshots of data from the source. Although this snapshot di erential problem is closely related to traditional joins and outerjoins, there are signi cant di erences, which lead to s...
متن کاملComparing Very Large Database Snapshots
Detecting and extracting modi cations from information sources is an integral part of data warehousing. For unsophisticated sources, in practice it is often necessary to infer modi cations by periodically comparing snapshots of data from the source. We call this problem the snapshot di erential problem. We show that this is closely related to outerjoins. In this paper we extend the traditional ...
متن کاملEfficient Data Mining with Evolutionary Algorithms for Cloud Computing Application
With the rapid development of the internet, the amount of information and data which are produced, are extremely massive. Hence, client will be confused with huge amount of data, and it is difficult to understand which ones are useful. Data mining can overcome this problem. While data mining is using on cloud computing, it is reducing time of processing, energy usage and costs. As the speed of ...
متن کاملA Solution to View Management to Build a Data Warehouse
Several techniques exist to select and materialize a proper set of data in a suitable structure that manage the queries submitted to the online analytical processing systems. These techniques are called view management techniques, which consist of three research areas: 1) view selection to materialize, 2) query processing and rewriting using the materialized views, and 3) maintaining materializ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996